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DESCRIPTION 

METHODS AND APPARATUS FOR MAKING AND REPLAYING DIGITAL 
VIDEO RECORDINGS, AND RECORDINGS MADE BY SUCH METHODS 

5 The present invention relates to methods and apparatuses for the 

recording and subsequent playback of digital video data, and to recordings 
made by such methods and apparatuses. The invention finds particular 
application for example in the recording of multiplexed audio/video data 
streams from digital TV broadcasts. Such broadcasts presently adopt a 
10 Transport Stream format in compliance with the MPEG-2 specification (iTU_T 
jii recommendation H.222.0|ISO/IEC 13818-1). Compliance with that 

1^ ^ specification is not essential to the invention, however. 

I T The i\/lPEG-2 specification mentioned above specifies generic methods 

15 for multimedia multiplexing, synchronisation and timebase recovery for use in 
i]j transmitting, recording and reproducing digital video (including audio) 

programmes. The specifications pn^vide a packet based multimedia 
: multiplexing where each elementary bit stream (video, audio, other data) is 

segmented into a Packetised Elementary Stream (PES), and then respective 

20 packets are multiplexed into either of two distinct stream types. Program 
Stream (PS) is a multiplex of variable length PES packets and designed for 
use in error free environments, such as recording on disc. Transport Stream 
(TS) consists of 188 byte fixed length packets, has functionality of multiple 
programme multiplexing as well as multiplexing of various PES packets of one 

25 programme, and is designed for use in error-prone environments such as 
broadcast. The multimedia synchronisation and timebase recovery are 
achieved by time-stamps for system time clock and presentation/decoding. In 
either case, the streams encode audio and especially video data in a 
compressed format, such that it is not possible to enter the stream at an 

30 arbitrary location, and find data valid for decoding. 

o 
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One known system for the distribution of video programmes is Digital 
Versatile Disc (DVD). This standard defines a specific form of MPEG-2 
Program Stream for storage of motion pictures and related material on optical 
disc. While DVD provides presently for the distribution of pre-recorded 

5 material, there is dearly a demand for both commercial and domestic audio 
and/or video (AA/) equipment capable of recording digital AN data from 
broadcasts and home-recorded material, just as the popular video cassette 
tape systems currently in use permit recording of analogue AA/ material. 

Consumers of digital systems will expect equal and in fact superior 

io facilities from a digital recording system, in terms of playing time, random 
access, indexing and so forth. Many proposals have been made to contribute 
to such a system. Mention is made in particular of our co-pending application 
WO 00/28544 (PHN 17161), not published at the priority date of the present 
application) which describes a system of characteristic point information (CPI) 

15 to identify entry and exit point from a recorded stream, without the need to 
parse the stream. Users will expect seamless editing and smooth playback of 
AA/ data. Techniques for seamless editing which can be applied to recordings 
made by the present invention are disclosed in our co-pending International 
Patent application claiming priority from United Kingdom patent application 

20 0007868.3 filed 31'* Mach 2000, and entitled "Methods and Apparatus For 
Editing Digital Video Recordings, and Recordings Made By Such Methods" 
(PHNL 000181), not published at the priority date of the present application, 
and WO-A-00/00981 (PHB 34262 WO). 

As mentioned above, the MPEG specification assumes that PS format 

25 will be used for recorded programmes, while TS is used in the established 
broadcast formats such as DVB. For the purposes of recording programmes 
from broadcasts, therefore, there is the possibility to convert TS to PS for 
recording, and then convert PS to TS for playback via digital channels. Such 
conversion requires demultiplexing and remultiplexing the streams, with regard 

30 to different buffer constraints and the like, which is expensive in terms of 
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computational power and the hardware required for buffering. Moreover, future 
Mnteractive TV' applications may be compromised if domestic recording 
formats adopt a fundamentally different approach to the original broadcast. 

Other proposals have therefore been made to record the broadcast 

5 format, or rather one programme selected from it, "transparently". In these 
solutions, packets corresponding to an elementary stream (an audio or video 
stream) of the required programme are simply recorded in sequence on the 
tape (or disc) with a time-stamp to indicate the time at which they should be 
replayed. Using the time-stamps, the recording device can stream the 

10 recorded programme back to a decoder and play back the broadcast 
programme. As one example, EP-A-0944086 proposes a system for 
recording a selected programme from a broadcast TS. The document in 
particular explains methods for adapting clock reference (PGR) values from 
the full-bandwidth TS multiplex (from satellite, say) into the lesser-bandwidth 

15 stream containing selected programme only of the TS recording being made. 

Unfortunately there are limitations of the transparent TS recording 
approach which become apparent when one adopts a random access 
medium. 

For example, the designer may wish to implement fast-forward (or 
20 reverse) play by skipping rapidly through the sequence of pictures to find and 
extract the 1-pictures only and send them to a decoder. The broadcast format 
(MPEG-2 Transport Stream) gives no guarantee that the packet identification 
field (PID) will be constant during a recording. Thus as we skip through the 
stream we do not know which packets to find unless we examine the stream 
25 sequentially. As another example, it is apparently assumed in EP-A-0944086 
that PGR data is at least included in the data relating to the selected 
programme, when this is not the case. 

It is an object of this invention to enable the provision of methods and 
30 apparatus for the recording of data streams such as MPEG-2 TS while 
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preserving the utility of the popular analogue video recording formats. 

!t is a further and alternative object to realise fully the enhanced 
flexibility of random access storage media such as optical discs, when 
recording digitally from broadcast formats such as DVB. 

5 In particular, the inventors have sought to provide conversion of 

broadcast TS format streams into a format suitable for recording and random 
access, without the burden of conversion to PS format. 

In a first aspect, the invention provides a method of processing data 
received in a transport stream format to produce a modified transport stream 

10 for recording on a recording medium to record the content of a selected 
audio-visual programme (hereinafter referred to as the wanted content), the 
received transport stream comprising a multiplex of elementary streams 
conveying not only the wanted content but also the content of other 
programmes not to be recorded (the unwanted content), said elementary 

15 streams of data having been encoded, divided into elementary stream (ES) 
packets with packet headers, the ES packets further sub-divided into a 
plurality of smaller transport packets, and the transport packets of the first and 
second elementary streams interleaved in the received transport stream with 
each other and with transport packets carrying data from neither stream, 

20 wherein the said modified transport stream is produced by: 

identifying and selecting received transport packets which correspond 
to the wanted content; 

recording the selected transport packets within a recording medium to 
form said modified transport stream, so as to preserve compliance with a 

25 transport stream target decoder model substantially without re-packetising or 
remultiplexing the streams; and 

recording auxiliary information on the same recording medium for use 
by a reproducing apparatus in gaining access to the content of the modified 
transport stream via any of a set of potential entry points throughout the 

30 wanted content. 



5 PHGB 000041 US 

In one embodiment, said auxiliary information includes information 
stored in a file separately from a file holding the modified transport stream, 
and comprises an index listing characteristic points within at least one of the 
wanted elementary streams. This avoids parsing the elementary stream 
5 during playback to identify individual entry points. 

The received transport stream may include stream mapping information 
identifying a transport packet ID code associated with each elementary 
stream, said stream mapping information being used to identify the wanted 
content and being subject to change throughout the received transport stream. 
10 In one such embodiment, current mapping information is inserted in the 
modified stream at each potential entry point. Alternatively, packet IDs may be 
re-mapped by modifying packet headers, so that the recording as a whole 
uses a predefined packet ID for each type of stream (audio, video, teletext 
etc). 

15 A second aspect of the invention provides a method of processing a 

received data stream to produce a modified stream for recording on a 
recording medium to record the content of a selected audio-visual programme 
(hereinafter referred to as the wanted content), the received data stream 
comprising a multiplex of elementary streams conveying not only the wanted 

20 content but also the content of other programmes not to be recorded (the 
unwanted content), the method comprising: 

- identifying and extracting from the received data stream the elementary 
data stream(s) conveying the wanted content; 

- identifying in the received data an elementary stream conveying auxiliary 
25 data useful in playback of the programme to be recorded; 

- in the case where the elementary stream conveying the auxiliary data is 
one conveying part of the unwanted content, generating a new elementary 
stream conveying the auxiliary data without that part of the unwanted 
content; and 

30 - multiplexing together the elementary stream(s) extracted from the received 
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Stream with the new elementary stream to form said modified data stream 
for recording. 

The auxiliary data may for example comprise clock reference data 
(PGR) for synchronising playback of the audio-visual programme. 

The invention in these and other aspects further provides methods of 
reproducing recorded data streams, and apparatuses for recording and 
reproducing such recordings, and recordings and signals as produced by such 
methods. Certain of these aspects and optional features of the invention are 
recited in the appended claims and others will be apparent from a 
consideration of the description of embodiments which follows. 

It will be understood that the Invention is applicable beyond the strict 
confines of MPEG-2 compliant streams, as similar considerations will 
generally arise when recording any coded multiplexed streams, especially 
from broadcast. 

Embodiments of the invention will now be described, by way of example 
only, with reference to the accompanying drawings in which: 

Figure 1 is a block schematic representation of an AN installation 
including novel optical disc record/replay apparatus embodying the present 
invention; 

Figure 2 represents generally the recording of blocks of infonnation in 
sequence areas on an optical disc; 

Figure 3 represents generally the playback of information stored on a 
disc in the process of Figure 2; 

Figure 4 illustrates the structure and key features of the MPEG Transport 
Stream (TS) format, which is the basis of recordings made by the present 
apparatus; 

Figure 5 illustrates various functions and their inter-relationships, in the 
process of making and replaying digital video recordings in the novel 
apparatus; 
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Figure 6 illustrates a method of PID re-mapping in the apparatus; and 
Figure 7 illustrates the replacement of PGR fields from a discarded 
stream in the apparatus. 

Digital Video Recording - General 

5 The following description considers in particular AA/ devices operating 

according to the MPEG standards (ISO/IEC 11172 for MPEG1 and, in 
particular, ISO/IEC 13818 for MPEG2) although the skilled practitioner will 
recognise the applicability of the present invention to other AA/ coding 
schemes not in conformance with the MPEG standard. 

10 Figure 1 shows within the dashed box an embodiment of an apparatus 

suitable to host the present invention, in the form of an optical disc record and 
playback device. In the description of the apparatus, the handling of frame- 
based video signals is concentrated upon, although it will be recognised that 
other types of signal may alternately or additionally be processed, such as 

15 audio or data signals, and that the invention is equally applicable to other 
memory devices such as magnetic data storage tapes and hard disc drives. 

The apparatus comprises an input terminal 1 for receiving a digital 
video signal to be recorded on optical disc 3. Further, the apparatus comprises 
an output terminal 2 for supplying a digital video signal reproduced from the 

20 disc. These terminals may in use be connected via a digital interface such as 
IEEE 1394 to a digital TV receiver and decoder 12 in the form of a "set-top 
box" (STB), which also receives broadcast signals from satellite, cable or the 
like, in MPEG TS format. The decoder provides display signals to a display 
device 14, which may be a conventional analogue TV set. 

25 The data area of the disc 3 consists of a contiguous range of physical 

sectors, having corresponding sector addresses. This address space is 
divided into sequence areas, with a sequence area being a contiguous 
sequence of sectors. The video recording apparatus as shown in Figure 1 is 
decomposed into two major system parts, namely the disc subsystem 6 and 

30 what is referred to herein as the video recorder subsystem 8, controlling both 
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recording and playback. The two subsystems are characterised by a number 
of features, as will be readily understood, including that the disc subsystem 
can be addressed transparently in terms of logical addresses (LA) and can 
guarantee a maximum sustainable bit-rate for reading and/or writing data 

5 from/to the disc. 

Suitable hardware arrangements for implementing such an apparatus 
are known generally to the skilled person, with one example illustrated in the 
above-mentioned WO-A-00/00981 (PHB 34262 WO). The apparatus generally 
comprises signal processing units, a read/write unit including a read/write 

10 head configured for reading from/writing to optical disc 3. Actuators position 
the head in a radial direction across the disc , while a motor rotates the disc. A 
microprocessor is present for controlling all the circuits in known manner. 

Many individual operations must be performed synchronously in order 
to make and reproduce digital video recordings, as will be described in more 

15 detail below. Broadly speaking, signal processing is performed in a manner 
adapted to convert the video data received via the input terminal 1 into blocks 
of information in the channel signal: the size of the blocks of information can 
be variable but may (for example) be between 2MB and 4MB. A block of 
information of the channel signal is written in a sequence area on the disc 3. 

20 The information blocks corresponding to the original video signal are written 
into many sequence areas that are not necessarily contiguous, as may be 
seen in the recording diagram of Figure 3, which arrangement is known as 
fragmented recording. It is a characteristic of the disc sub-system that it is able 
to record and write such fragmented recordings fast enough to meet real-time 

25 deadlines, provided that certain allocation rules are met when the recordings 
are made. 

In order to enable editing of the video data recorded in an earlier 
recording step on the disc 3, the apparatus is further operable to generate the 
bridging sequence to link two video streams, as is described in our co-pending 
30 application claiming priority from GB 0007868.3, mentioned above. 
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Figure 2 illustrates recording of a video signal. In the video recorder 
subsystem 8 of Figure 1, the video signal, which is a real time signal, is 
converted into a real-time file RTF or "clip file" as shown in the upper part of 
Figure 2. The real-time file consists of a succession of signal block sequences 

5 SEQ for recording in corresponding (although fragmented) sequence areas. 
There is no constraint on the location of the sequence areas on the disc and, 
hence, any two consecutive sequence areas comprising portions of data of the 
video signal recorded may be anywhere in the logical address space LAS as 
shown in the lower part of Figure 3. Within each sequence area, real time data 

10 is allocated contiguously. Each real time file represents a single AA/ stream. 
The data of the AN stream is obtained by concatenating the sequence data in 
the order of the file sequence. 

Figure 3 illustrates playback of a video signal recorded on the disc 3. 
Playback of a video signal is controlled by means of a playback control (PBC) 

15 program stored elsewhere in the logical address space. In general, each PBC 
program defines a new playback sequence PBS, which may comprise an 
edited version of recorded video and/or audio segments, and may specify a 
sequence of segments from respective sequence areas. Pointers to the logical 
addresses of the individual sequences making up the desired sequence are 

20 directly or indirectly included in the PBC. As may be seen from comparison of 
Figures 2 and 3, the PBC required to recreate the original file sequence (from 
Figure 2) re-orders the fragmented recorded segments to provide a playback 
frame succession corresponding to the original sequence. 

Transport Stream Format 

25 Figure 4 illustrates the key features and structure of the MPEG-2 

Transport Stream (TS) format. In the apparatus of Figure 1, the data is stored 
on disc 3 in a TS-based format, similar to that in which it is received off-air, 
and in which it is passed over the digital interfaces 1 and 2. The TS is a 
continuous stream of transport packets labelled T-PKT in the drawing, each 

30 comprising 188 bytes of data, and having the format shown at the top of 
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Figure 4. Full details of the MPEG-2 Transport Stream, including Syntax, 
semantics and constraints applicable, will be found in ITU-T recommendation 
H.262 I ISO/IEC 13818-1. Information about the MPEG-2 system is available 
online at http://www.mpeg.org. Briefly, each transport packet includes a 

5 header portion and a payload portion, the payload being indicated as bytes 
DAT-0 to DAT-N in the figure. The header begins with a distinctive 
synchronisation byte SYNC followed by various flags and control fields 
including a transport error indicator TEI, a payload unit start indicator USI, a 
transport priority indicator TPI, a packet identification PID, transport 

10 scrambling control field TSC, adaptation field control AFC and continuity 
counter CC. 

The continuity counter CC is a 4-bit field incrementing with each 
Transport Stream packet for a given PID. CC wraps around to 0 after its 
maximum value. CC is not incremented when the adaptation field control of 

15 the packet indicates no payload for the packet. Continuity counter is a feature 
of the Transport Stream, not the Program Stream format, which is designed to 
allow devices to detect loss of packets in an error-prone channel. Packets 
following a lost packet generally have to be discarded until new 
synchronisation with new headers can be obtained. This makes editing TS 

20 streams more difficult however: the decoder has to be able to distinguish 
between discontinuities deliberately introduced by editing, and those 
introduced by transmission errors. These issues are addressed in the co- 
pending application "Methods and Apparatus For Editing Digital Video 
Recordings, and Recordings Made By Such Methods" (PHNL 000181) 

25 mentioned above. 

Depending on the contents of field AFC, there may be present an 
adaptation field AF, occupying some of the space othenwise allocated to 
payload data. The adaptation field AF may for example contain a discontinuity 
indicator flag as defined in ISO/IEC 13818 for MPEG2. When set to 'r, this 

30 flag indicates that the discontinuity state is true for the current Transport 
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Stream packet. The discontinuity indicator is used to indicate two types of 
discontinuities, system time-base discontinuities and continuity counter 
discontinuities. In addition to optional data fields of pre-defined meaning, tiie 
adaptation field can be padded with stuffing bytes, so as to match the PES 

5 packet ends to TS packet boundaries. 

A system time-base discontinuity is indicated by the use of the 
discontinuity indicator in Transport Stream packets of a PID designated as 
carrying a system time-base in the form of program clock reference PGR (refer 
to ISO/IEC 13818-1 Section 2.4.4.9), Quoting from that specification, when the 

10 discontinuity state is true for a Transport Stream packet of a PID designated 
as a PCR_P1D, the next PGR in a Transport Stream packet with that same PID 
represents a sample of a new system time clock for the associated program. 
The discontinuity indicator shall be set to '1' in the packet in which the system 
time-base discontinuity occurs. The discontinuity indicator bit may also be set 

15 to '1' in Transport Stream packets of the same PGR_PID prior to the packet 
which contains the new system time-base PGR. In this case, once the 
discontinuity indicator has been set to 'T, it shall continue to be set to '1' in all 
Transport Stream packets of the same PGR_PID up to and including the 
Transport Stream packet which contains the first PGR of the new system time- 

20 base. 

In the example of the DVB digital broadcast format, the data rate of the 
TS Stream is around 40 (Mbits/s), while the typical data rate for an audio 
visual programme is less than 10 Mbits/s. Accordingly, as shown at TS in 
Figure 4, various programmes PROG1, PROGS can be multiplexed into a 

25 single transport stream. The field PID of each transport packet indicates one 
elementary stream to which that packet relates, these being interleaved in 
units of transport packets with plentiful other streams. One programme may 
for example comprise of a video stream (PID='055' in the example), an audio 
stream (PID='056') and teletext data stream (PID='057'). The correspondence 

30 between PID values and programmes, and the type of data carried with each 
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PID is maintained in the form of programme specific information (RSI) tables. 
Periodicaily witliin the transport stream a programme association table PAT is 
carried in a special stream of transport packets with PID=0. The PAT in turn 
indicates for PR0G1, PROGS etc., which stream carries a programme 
5 mapping table PMT, which lists completely the different PID values relating to 
the single programme, and describes the content of each one (video, audio, 
alternative language audio, etc.). These tables and other data for control 
purposes are referred to both as system information and programme specific 
information (PSI). 

10 To reproduce or record a given programme (PROG1) from the transport 

stream, the payload DAT-0 to DAT-N of successive transport packets having 
that PID is concatenated into a stream, and this stream carries packetised 
elementary stream packets PES-PKT, which are further defined in the MPEG- 
2 specification. Each PES packet begins with a distinctive packet start code 

15 prefix PSCP. Next in the PES packet header is a stream identifier SID which 
identifies the type of elementary stream (for example video, audio, padding 
stream or private stream). PES packets do not have a fixed length unless 
specified in a particular application, and a PES packet length field LEN 
specifies the number of bytes in the PES packet. Various control and flag 

20 fields C&F then follow, including for example a data alignment indicator DAI 
and a header length field HLEN. Various optional fields are then present 
within the header HDAT, depending on the value of associated flags in the 
C&F field for example, a presentation time stamp PTS may be present 
specifying the time with reference to a system clock at which a "presentation 

25 unit" picture, audio frame etc.) beginning in the present PES packet is due to 
be presented. In certain cases, presentation units are decoded in a different 
order from their presentation order, in which case a decoding time stamp DTS 
may also be present. 

The payload PY-0 to PY-N of successive PES packets having the same 
30 SID forms a continuous elementary stream of data shown schematically at ES 
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in Figure 4. In the case of a video elementary stream ES-VIDEO, various 
picture sequences of clips SEQ are present, each including at its start a 
sequence header SEQH. Various parameters of the decoder including 
quantisation matrices, buffer sizes and the like are specified in the sequence 

5 header. Accordingly, correct playback of the video stream can only be 
achieved by starting the decoder at the location of a sequence header. Within 
the data for each sequence are one or more "access units" of the video data, 
each corresponding to a picture (field or frame depending on the application). 
Each picture is preceded by a picture start code PSC. A group of pictures 

10 GOP may be preceded by a group start code GSC, all following a particular 
sequence header SEQH. 

As is well known, pictures in MPEG-2 and other modern digital formats 
are encoded by reference to one another so as to reduce temporal 
redundancy. Motion compensation provides an estimate of the content of one 

15 picture from the content already decoded for a neighbouring picture or 
pictures. Therefore a group of pictures GOP will typically comprise: an intra- 
coded "1" frame, which is coded without reference to other pictures; two or 
three "P" (predictive) coded pictures which are coded using motion vectors 
based on a preceding I frame; and bi-directional predicted "B" pictures, which 

20 are encoded by prediction from I and/or P frames before and after them in 
sequence. The amount of data required for a B picture is less than that 
required for a P picture, which in turn is less than that required for an I picture. 
On the other hand, since the P and B pictures are encoded with reference to 
other pictures, it is only the 1 pictures which provide an actual entry point for 

25 starting playback of a given sequence. Furthermore, it will be noted that for 
the GOP data, the I and P pictures are encoded before the corresponding B 
pictures, and then re-ordered after decoding so as to achieve the correct 
presentation order. Accordingly, B and P pictures are examples where the 
presentation time stamp PTS and decoding time stamp DTS may differ. 

30 Finally in Figure 4, there is shown a representation of an audio 
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elementary stream ES-AUDIO. This comprises simple frames of data FRM 
with frame start codes. Various audio formats are permitted, varying in terms 
of sample rate (32 kHz, 48 kHz etc.) and also data rate (for example 32 kbits/s 
per second, or variable). These and other properties of the audio and video 

5 streams are encoded in the programme specific information PSl and in the 
PES packet headers. 

Audio frames and video pictures having the same presentation time 
stamp PTS are those which are to be presented simultaneously at the output 
of the decoder. On the other hand, there is great freedom in the scheduling of 

10 packets of data from the different elementary streams, such that audio and 
video access units having the same PTS value can arrive in the transport 
stream TS up to one second apart. 

System Target Decoder 

In order to ensure that buffering and other aspects of a real decoder are 

15 able to decode each type of stream without breaks in the presented audio- 
visual programme, the MPEG-2 standard specifies a transport stream "system 
target decoder" (T-STD) model. There is also a program stream system target 
decoder (P-STD) model. Broadly, the T-STD is a model of a hypothetical real 
decoder having means for de-multiplexing the different elementary streams of 

20 the TS format, having decoders for each of the audio, video and system 
control types of data, and having buffers between the incoming stream and 
the decoder for holding data of each elementary stream between its arrival 
from a data channel and its actual time of decoding and presentation. The 
audio decoder in TS format has a smaller buffer than in the P-STD. Each 

25 main buffer in the T-STD is preceded by a transport buffer which acts to 
smooth the rather "bursty" data in the transport stream itself. While data for a 
given stream may arrive in a burst of several transport packets at a peak rate 
of 40 megabits per second, the average rate of such a stream, when taking 
into account the entire transport stream multiplex, is far lower. A "leak rate" is 

30 defined for the transport buffer so as to throttle the incoming data to a rate of 2 
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megabits per second, assuming that there is data to be passed into the main 
buffer. 

Recording Operations - Modified TS format for DVR 

The basic idea is not to record the broadcast signal transparently in the 
5 form it is broadcast, but instead to reformat into a format suitable for playback 
from the disc. The inventors have recognised that: 

® The processing that has to be done to interpret the broadcast stream is 
sequential in nature. This is because watching TV in real-time is linear and 
10 the streams were designed with this in mind. Recording a signal to disc in 
real-time is also sequential, so all the information needed is available to the 
system at record time. 

® Playback from a disc is potentially non-sequential (due to random access, 
15 fast search, combining two recordings into one new one, skipping parts of 
a recording and so forth). 

« In general, playback may happen many times whereas recording happens 
just once. 

20 

From these observations, the aim of the present system is to do the 
processing needed to make playback and editing easier when the signal is 
recorded, rather than burden all playback and edit operations with 
unnecessary complexity and book-keeping. 

25 Figure 5 illustrates various functions and their inter-relationships, in the 

process of making and replaying digital video recordings in the novel 
apparatus. It will be appreciated that the implementation of the functions and 
interconnections shown may be distributed between hardware and software as 
a matter of design choice. The apparatus will in practice comprise a selection 

30 of dedicated logic circuits and programmable processors for digital signal 
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processing and control. 

Beginning at the start of the recording process in Figure 5, a broadcast 
signal ANT is received and decoded at 500 by a tuner function within the set 
top box (12 in Figure 1). This provides a digital video input signal DVIN in 

5 MPEG-TS format, corresponding to the signal on input terminal 1 in Figure 1. 
Within the recorder apparatus, function 502 selects a desired programme from 
any broadcast TS multiplex, filtering out the packets whose PID does not 
match the desired programme. 

At 504, the filtered stream is parsed into various elementary streams 

10 506 (video), 508 (audio), 510 (graphics) and 512 (system information). The 
video, audio and graphic streams are re-multiplexed at 514 into the real time 
file format for storage on disc 3 via the file system 520 of the disc subsystem 
6. As part of this reformatting operation, lists of characteristic point 
information CPl are generated at 516, and also added to the file structure on 

15 the disc. 

Based on the system information obtained at 512, not to mention 
options selected by the user of the apparatus, various control and indexing 
structure such as a table of contents, playback control programme, edits and 
so forth are generated at 518, for storage on a disc via the file system 520. 

20 For playback, the user (or the system itself) can choose whether to 

obtain digital output signals DVOUT at terminal 2 (Figure 1), or analogue 
signals ANOUT at another terminal 2'. To this end, at 522 the recorded real 
time files can be reproduced, using also the CPl and control information, and 
user input, to reproduce a TS format digital video output at step 522, 

25 Alternatively, at step 524, the same information from the disc can be decoded 
to produce the analogue output signals ANOUT, in accordance with the audio, 
video and graphics content of the recorded stream. 

The novel aspects of the process will now be described in more detail. 

Characteristic Point Information (CPl) 
30 Additional information for identifying and navigating the recording is 
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stored with the PBC or elsewhere on the disk. In particular, characteristic 
point information (CP!) is stored for each real-time file (clip file), as illustrated 
at 516 in Figure 5. CP! comprises tables of locations within the recorded 
streams which are suitable for use as entry points in case of simple editing, 

5 interactive playback and trick play modes (fast picture search, for example). In 
general, CPI is used to determine the location of relevant data elements in a 
clip, without having to read and parse the clip itself. Indexing information like 
this is routinely present in Program Stream-based (PS) pre-recorded material, 
but is not generally provided in broadcast material, based on Transport: 

10 Stream format. More detail of one form of characteristic point information 
(CPI) is contained in our above-referenced co-pending application 
PCT/EP99/08252 (PHN 17161 EP), not published at the present priority date. 
In order to be able to use CPI effectively, the total amount of CPI data should 
be limited. This allows all or most of the CPI data to be stored in system 

15 memory, so that there is no need to write or read CPI data to or from the disc 
during time-critical operations. Actual limitations might be specified for 
example in terms of points, or in terms of the actual size of CPI data on the 
disc. 

For each clip file there is an accompanying CPI sequence, containing a 
20 list of all the characteristic points of that clip file. Clip files may be permitted to 
share data with other clip files to save duplicating data on disc, in a known 
manner. Similarly, CPI sequences can also share points with other CPI 
sequences. 

Transport stream PID mapping 

25 As explained above with reference to Figure 4, within the MPEG-2 

Transport Stream, the desired a programme consists of several elementary 
streams (e.g. Audio and Video and maybe others). Each elementary stream is 
carried in a stream of packets identified by a PID. Which PID number refers to 
each elementary stream is identified in the MPEG Program Specific 

30 Information (PSl) tables. These tables are the program association tables 
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(PAT) and the program map table (PMT). 

Figure 6 illustrates schematically the presence of programme specific 
information RSI, comprising PAT and PMT, interspersed effectively at random 
intervals in among the packets of elementary stream data ES. Recall that PSI 
5 is always carried in a special stream with PID= '000'. The "original" PID of 
each packet, as received in the digital input signal DVIN is shown by a solid 
black circle •. The "original" programme mappings are given periodically by 
PSI fields, marked with a solid black square ■. Every time PSI appears, the 
mapping of PID's to the elementary streams of the desired programme may 
10 change. The PAT/PMT allow the service provider to define and redefine the 
PID numbers that are used to identify program elements at will. A digital TV 
decoder (12 in Figures 1 and 5) has continually to check for updates to the 
PAT/PMT as it decodes the programme. The advantage for the service 
provider of this system is that it makes it easier to insert different content at 
15 different parts of his distribution network without central co-ordination. 

This system works well when the decoder examines the broadcast 
stream sequentially. However, when we store the signal on a disc and then 
later wish to make random access to the stream we have a problem. 
Therefore, even making use of the characteristic point information CPI, merely 
20 jumping to the address in the stream of an l-picture does not work. Instead the 
player has to jump to the PAT and then the PMT sections that precede the I- 
picture to discover what PID mapping is locally in force. Then the player can 
jump to the l-picture address. This will slow down random access 
unacceptably. 

25 Two approaches are now proposed to reformat the data before storing 

it on the disc. As noted above, during the recording all the information needed 
by the recorder to do the reformatting can be easily available. 

In the first approach, (not illustrated) copies of the PAT/PMT are 
inserted immediately before each access unit (in video streams, say, each I- 

30 picture may begin an access unit for this purpose) as it is recorded. This 
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requires a relatively small amount of processing. There are drawbacks to the 
first approach, however, namely: 

® It disturbs the timing of each packet in the MPEG buffer models. This may 
5 require recalculation of a number of parameters in the packet headers. 
® It wastes storage capacity on the disc (but a relatively small amount) 
« The resulting stream still may use many different PIDs. It is not clear how 
practical decoders will behave if the PID is updated very rapidly in 
succession, as may happen if we edit streams together. 

10 

The second approach, illustrated in Figure 6, is to re-map the PIDs for 
every packet to a defined set of PIDs that are used for all recordings. During 
recording, the apparatus keeps track of the current PAT/PMT as usual, in 
order to filter out the unwanted streams. A simple table is established to map 

15 the PIDs specified by the (current) PAT/PMT, to the standard set of PIDs 
defined for recordings. The PID of every transport packet is modified 
according to the table so that the recorded streams have constant PID 
irrespective of changes in the PAT/PMT. The modified PIDs are indicated in 
the drawing by open circles O in the "DVR" format, as recorded on the disc 3. 

20 With the standardised PIDs used for every part of a recording, random 

access is no longer a problem, because, whatever the PAT/PMT (■) in the 
received TS, the wanted PID will always be the same, in the recorded stream. 
Given that the original PAT/PMT («) are still carried within the recorded 
stream, they would conflict with the modified PIDs (O) if fed to the decoder 

25 during playback. Accordingly, during playback, any PAT/PMT included in the 
stream read from the disk are replaced with modified PAT/PMT (open squares 
□ before being output from the apparatus at step 522 or fed to the local 
decoder at 524. 

In a variation on the second approach, the PAT/PMT also can be re- 
30 mapped during recording. Naturally this involves more processing during the 
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recording step, but leaves nothing to do at playback time but feed the streams 
to a standard decoder. 

Advantages seen for the second approach are: 
® There is no impact on buffering strategies. 
5 ® There is not a potential problem with practical decoders. 
® There is no impact on storage capacity. 



Disadvantages of the second approach are: 
® Every packet recorded has to be modified. 
10 ® The PAT and PMT that is transmitted needs to be changed during 
recording or playback, to reflect the new PIDs that are stored. 



The first disadvantage is relatively minor, given that the entire stream is 
being parsed byte-by-byte to identify entry points, create CPI entries and so 
15 forth. A simple single-byte substitution leaves the exact packet structure of 
each stream intact. The second disadvantage goes slightly contrary to the 
ideal of eliminating complicated processing at playback time, but this is a 
relatively minor complication in consideration of the benefits gained overall. 

PCR Replacement 

20 The basic timing information for transport streams is provided by a 

sequence of time stamps known as the program clock reference (PCR). 
Within one multiplex several independent streams of PCRs can be 
transmitted. Programmes within a multiplex can either carry their own PCR or 
share the PCR of another program. PCRs are carried within the packet 

25 header (adaptation field) of a nominated PID. It is possible that the PCR PID 
is not one the elementary streams belonging to a recording (that is, it is one 
being discarded in step 502). 

The apparent consequence of this would be that it is necessary to 
record the PCR PID (as in transparent recording of the entire multiplex). This 
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may lead to the waste of a considerable amount of disc capacity and 
bandwidth, however. As one illustration, suppose that one broadcaster (for 
example company X) includes two programme channels (XI and X2, say) 
within the multiplex, and suppose that channels XI and X2 share a PGR PID. 

5 Now suppose we wish to record X2 audio and video. If the PGR PID for both 
channels is in the elementary stream that carries the vision for XI, for 
transparent recording we would have to record the vision for X1 as well as X2, 
nearly doubling the required storage capacity. 

One solution would be to modify the packet headers of one of the 

10 existing elementary streams (for example, X2 video), so that it can carry the 
PGR. The disadvantage of this approach is that it disturbs the packet 
structure considerably, effectively requiring re-packetisation at the TS level. 
Any disruption of the packet structure potentially has implications for timing 
and buffer constraints. 

15 The solution adopted in the novel apparatus described herein is to 

insert a "dummy" PID, that does not exist in the original broadcast and is 
created solely to support the recording, to carry the PGR. This "dummy" 
stream can have any unused PID, conveniently fixed for ail recordings. 

Figure 7 illustrates the operation of PGR-PID insertion in the apparatus 

20 according to this solution. Elementary streams for XI (video and audio) and 
for X2 (video and audio) are shown as part of the received TS format multiplex 
DVIN. The video stream for X1 labelled "V(X1)" includes periodically the PGR 
field, indicated by a diamond (♦). In the recorded format DVR, when 
recording X2 as described above, both streams V(X1) and A(X1) are omitted, 

25 while streams V(X2) and A(X2) are included in the DVR multiplex. A separate 
stream labelled PGR-PID is added, containing packets only when needed for 
the PGR (^). The saving in data quantity is apparent compared with the 
stream V(X1). The degree of saving will be far greater than appears from the 
drawing, because the drawing exaggerates the ratio of PGR data and audio 

30 payload to video payload for the sake of illustration only. 
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Clearly in the preferred embodiment the behaviour with regard to PGR 
will be combined with the behaviour described above in relation to RID. These 
features are illustrated separately in Figures 6 and 7 for the sake of clarity 
only. The reader will appreciate that these drawings exaggerate the ratio of 
5 RSI and PGR data to audio/video payload and exaggerate the ratio of audio 
payload to video payload, for the sake of clear illustration. 

Alternative Embodiments/Modifications 

As will be generally understood, the preceding examples relate to 
frame-based rather than field-based editing: this is because the general unit of 
10 video encoding in MPEG is the frame. It will be recognised by the skilled 
person that MPEG compliance is not mandatory (as mentioned above) and 
that the techniques described herein may be also applied to non-MPEG field- 
based data. 

There are also defined so-called "incognisant" formats, which may be 
15 broadcast. These follow the TS at the T-packet level, but are coded in non- 
standard fashion or, more likely, they are MPEG streams scrambled for 
security/pay per view. Unless the recorder incorporates the appropriate 
descrambler, these stream cannot be parsed to implement the extra steps 
proposed above. 



